Search CORE

17 research outputs found

Injection of Knowledge in a Sourcing Recommender System

Author: Faron Catherine
Tettamanzi Andrea
Tounsi Dhouib Molka
Publication venue: HAL CCSD
Publication date: 14/12/2020
Field of study

International audienceRecommender systems provide suggestions to users for items that best meet their needs. In this work, we study the benefits of using knowledge and, more specifically, a 'bag of concepts' representation to enhance a recommender system in the sourcing domain. We tested our approach in a real-world case study provided by the Silex company. The experimental results show that injecting knowledge in the recommendation process outperforms word embedding approaches

INRIA a CCSD electronic archive server

Measuring Clusters of Labels in an Embedding Space to Refine Relations in Ontology Alignment

Author: Faron Catherine
Tettamanzi Andrea G. B.
Tounsi Dhouib Molka
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/10/2021
Field of study

International audienceOntology alignment plays a key role in the management of heterogeneous data sources and metadata. In this context, various ontology alignment techniques have been proposed to discover correspondences between the entities of different ontologies. This paper proposes a new ontology alignment approach based on a set of rules exploiting the embedding space and measuring clusters of labels to discover the relationship between entities. We tested our system on the OAEI conference complex alignment benchmark track and then applied it to aligning ontologies in a real-world case study. The experimental results show that the combination of word embedding and a measure of dispersion of the clusters of labels, which we call the radius measure, makes it possible to determine, with good accuracy, not only equivalence relations, but also hierarchical relations between entities

INRIA a CCSD electronic archive server

A Hybrid Bi-LSTM-CRF Model for Sequence Labeling Applied to the Sourcing Domain

Author: Daoud Hasnaa
Faron Catherine
Rancati Jerôme
Tettamanzi Andrea
Tounsi Dhouib Molka
Publication venue: HAL CCSD
Publication date: 29/06/2020
Field of study

National audienceIn a number of areas, companies are often faced with the task of dealing with large amounts of textual customers' requests. Automating information extraction like key phrases from customers' requests can help to accelerate the processing process. Silex France is currently facing this challenge in the context of processing sourcing requests.In this article, we share our sequence labeling results based on a hybrid method Bi-LSTM-CRF, in an industrial context. This work was integrated in the B2B Silex platform for service providers recommendation. Experiments with the B2B Silex platform data show that, with a good choice of features to extract and optimal choice of hyper-parameters, the combination of the Bi-LSTM and CRF helps to achieve good results even in a context of small data. Indeed, the textual content processed is in the form of complete sentences generated by users, and thus is subject to typing errors. To handle this type of data we combine several types of extracted features describing the textual content such as: (i) semantics, (ii) syntax, (iii) word characters, (iv) position of words.Dans un certain nombre de domaines, les entreprises sont souvent confrontées à la tâche de traiter au quotidien des quantités importantes de demandes textuelles. L'extraction automatique des informations clés à partir des demandes clients, peut aider à accélérer le processus de traitement. Silex France est aujourd'hui confrontée à ces enjeux dans le cadre du traitement des demandes de sourcings. Dans cet article, nous partageons nos résultats d'étique-tage de séquences en nous basant sur une méthode hybride BiLSTM-CRF, dans un contexte industriel. Le travail est in-tégré dans la plate-forme B2B Silex pour la recommanda-tion des prestataires de services. Les expériences faites sur les données de la plateforme B2B Silex montrent qu'avec un bon choix de features à extraire et des hyperparamètres, la combinaison du modèle Bi-LSTM-CRF permet de réus-sir l'extraction d'infomation à partir des demandes tex-tuelles, même dans un contexte de petites données (small data). En effet, le contenu textuel traité est sous forme de phrases complètes générées par des utilisateurs, et est ainsi exposé à des erreurs de frappe. Pour gérer ce type de don-nées, nous combinons plusieurs types de features extraites décrivant le contenu textuel tels que : (i) la sémantique, (ii) la syntaxe, (iii) les caractères des mots, (iv) la position des mots

INRIA a CCSD electronic archive server

Knowledge Graphs Evolution and Preservation -- A Technical Report from ISWS 2019

Author: Ahmad Sakor
Alba Catalina Morales Tirado
Alessandro Umbrico
Allard Oelen
Amine Dadoun
Aneta Koleva
Anna Nguyen
Ariam Rivas Mendez
Axel Polleres
Bilal Koteich
Chang Sun
Chuangtao Ma
Claudia d'Amato
Eleonora Marzi
Fabio Mariani
Federico Igne
Felix Bensmann
Frances Gillis-Webber
Francesca Alloatti
Francesca Giovannetti
Genet Asefa Gesese
Gianmarco Spinaci
Glenda Amaral
Harald Sack
Harm Delva
Heiko Paulheim
Irene Celino
Ismail Harrando
Ivan Heibi
Jaime Salas
Jan Portisch
John Domingue
Kabul Kurniawan
Kader Pustu-Iren
Kholoud Alghamdi
Laurine Huber
Lientje Maas
Ling Cai
Luigi Asprino
Maheshkumar Mistry
Marc Gallofré Ocaña
Margherita Porena
Marieke van Erp
Martin Beno
Martin Mansfield
Marìa Granados Buey
Meilin Shi
Mengya Liu
Michalis Georgiou
Michel Dumontier
Mohamad Yaser Jaradeh
Molka Tounsi Dhouib
Mortaza Alinam
Nacira Abbas
Neha Keshan
Omaima Fallatah
Paola Espinoza Arias
Riley Capshaw
Russa Biswas
Sebastian Rudolph
Sebastián Ferrada
Sepideh Mesbah
Soheil Roshankish
Stefano De Giorgis
Tabea Tietz
Thomas Schleider
Valentina Anita Carriero
Valentina Pasqual
Valentina Presutti
Viet Bach Nguyen
Vincent Emonet
Vitor Horta
Weiqin Xu
Wouter van den Berg
Publication venue
Publication date: 01/01/2020
Field of study

One of the grand challenges discussed during the Dagstuhl Seminar "Knowledge Graphs: New Directions for Knowledge Representation on the Semantic Web" and described in its report is that of a: "Public FAIR Knowledge Graph of Everything: We increasingly see the creation of knowledge graphs that capture information about the entirety of a class of entities. [...] This grand challenge extends this further by asking if we can create a knowledge graph of "everything" ranging from common sense concepts to location based entities. This knowledge graph should be "open to the public" in a FAIR manner democratizing this mass amount of knowledge." Although linked open data (LOD) is one knowledge graph, it is the closest realisation (and probably the only one) to a public FAIR Knowledge Graph (KG) of everything. Surely, LOD provides a unique testbed for experimenting and evaluating research hypotheses on open and FAIR KG. One of the most neglected FAIR issues about KGs is their ongoing evolution and long term preservation. We want to investigate this problem, that is to understand what preserving and supporting the evolution of KGs means and how these problems can be addressed. Clearly, the problem can be approached from different perspectives and may require the development of different approaches, including new theories, ontologies, metrics, strategies, procedures, etc. This document reports a collaborative effort performed by 9 teams of students, each guided by a senior researcher as their mentor, attending the International Semantic Web Research School (ISWS 2019). Each team provides a different perspective to the problem of knowledge graph evolution substantiated by a set of research questions as the main subject of their investigation. In addition, they provide their working definition for KG preservation and evolution

Archivio istituzionale della ricerca - Università di Bari

Ingénierie des connaissances dans le domaine du sourcing pour la recommandation de prestataires

Author: Tounsi Dhouib Molka
Publication venue: HAL CCSD
Publication date: 26/03/2021
Field of study

This CIFRE doctoral thesis is part of a collaborative research project between the I3S laboratory of the University of Côte d'Azur and the Silex company, and addresses the field of recommendation systems. Silex is a start-up that develops a Software-as-a-Service sourcing tool that allows companies to provide a description of their professional activities, their offers and/or the services they are looking for in natural language (currently French).In this context, the objective of this thesis is to propose a decision support system by exploiting the semantic knowledge that are extracted from the textual descriptions of requests for services and providers, in order to recommend relevant providers for a service request.The contributions of this thesis are the following. First, we proposed a vocabulary for the sourcing field by reusing and integrating existing vocabularies, in order to semantically annotate the textual descriptions of providers and requests for services. Second, we proposed an automatic alignment method to establish the correspondence between different concepts of the considered vocabularies. This approach is based on rules exploiting embedding space and measurements on groups of labels to discover the relationships between concepts. Third, we proposed an algorithm for extracting named entities from the textual descriptions of service requests and providers, and an algorithm for semantic annotation of these descriptions, based on the linking of the extracted entities with the concepts of the defined vocabulary.Fourth, we proposed a provider recommendation algorithm that exploits these knowledges extracted.Finally, we studied the contribution of using ontological knowledge to improve our decision support system for the sourcing domain in order to recommend relevant providers for a service request.The contributions of this thesis are the following. First, we proposed a vocabulary for the sourcing field in order to semantically annotate the textual descriptions of providers and requests for services. This vocabulary was built by reusing and integrating existing vocabularies. Second, we proposed an automatic alignment method to establish the correspondence between different concepts of the considered vocabularies. This approach is based on rules exploiting embedding space and measurements on groups of labels to discover the relationships between concepts. Third, we proposed an algorithm for extracting named entities from the textual descriptions of service requests and providers, and an algorithm for semantic annotation of these descriptions, based on the linking of the extracted entities with the concepts of the defined vocabulary.Fourth, we proposed a provider recommendation algorithm that exploits these knowledge extracted.Finally, we studied the contribution of using ontological knowledge to improve our decision support system for the sourcing domain.Cette thèse de doctorat CIFRE s'inscrit dans le cadre d'un projet de recherche collaboratif entre le laboratoire I3S de l'Université Côte d'Azur et la société Silex et aborde le domaine des systèmes de recommandation. Silex est une start-up qui développe un outil de sourcing Software-as-a-Service permettant aux entreprises de fournir une description de leurs activités professionnelles, de leurs offres et/ou des services qu'elles recherchent en langue naturelle (actuellement le français).Dans ce contexte, l'objectif de cette thèse est de proposer un système d'aide à la décision en exploitant les connaissances sémantiques extraites à partir des descriptions textuelles des demandes de prestation et des prestataires, afin de recommander des prestataires pertinents pour une demande de prestation.Les contributions de cette thèse sont les suivantes. Premièrement, nous avons proposé un vocabulaire pour le domaine du sourcing en réutilisant et en intégrant des vocabulaires existants, afin d'annoter sémantiquement les descriptions textuelles des prestataires et des demandes de prestation. Deuxièmement, nous avons proposé une méthode d’alignement automatique afin d'établir la correspondance entre différents concepts des vocabulaires considérés. Cette approche se base sur des règles exploitant l'espace des plongements lexicaux et des mesures sur des groupes d'étiquettes pour découvrir les relations entre concepts. Troisièmement, nous avons proposé un algorithme d'extraction des entités nommées à partir des descriptions textuelles des demandes de prestation et des prestataires et un algorithme d'annotation sémantique de ces descriptions, basé sur le liage des entités extraites avec les concepts du vocabulaire défini.Quatrièmement, nous avons proposé un algorithme de recommandation de prestataires qui exploite ces annotations sémantiques.Finalement, nous avons étudié l'apport de l'utilisation de connaissances ontologiques afin d'améliorer notre système d'aide à décision pour le domaine du sourcing

INRIA a CCSD electronic archive server

Knowledge engineering in the sourcing domain for the recommendation of providers

Author: Tounsi Dhouib Molka
Publication venue
Publication date: 26/03/2021
Field of study

Cette thèse de doctorat CIFRE s'inscrit dans le cadre d'un projet de recherche collaboratif entre le laboratoire I3S de l'Université Côte d'Azur et la société Silex et aborde le domaine des systèmes de recommandation. Silex est une start-up qui développe un outil de sourcing Software-as-a-Service permettant aux entreprises de fournir une description de leurs activités professionnelles, de leurs offres et/ou des services qu'elles recherchent en langue naturelle (actuellement le français).Dans ce contexte, l'objectif de cette thèse est de proposer un système d'aide à la décision en exploitant les connaissances sémantiques extraites à partir des descriptions textuelles des demandes de prestation et des prestataires, afin de recommander des prestataires pertinents pour une demande de prestation.Les contributions de cette thèse sont les suivantes. Premièrement, nous avons proposé un vocabulaire pour le domaine du sourcing en réutilisant et en intégrant des vocabulaires existants, afin d'annoter sémantiquement les descriptions textuelles des prestataires et des demandes de prestation. Deuxièmement, nous avons proposé une méthode d’alignement automatique afin d'établir la correspondance entre différents concepts des vocabulaires considérés. Cette approche se base sur des règles exploitant l'espace des plongements lexicaux et des mesures sur des groupes d'étiquettes pour découvrir les relations entre concepts. Troisièmement, nous avons proposé un algorithme d'extraction des entités nommées à partir des descriptions textuelles des demandes de prestation et des prestataires et un algorithme d'annotation sémantique de ces descriptions, basé sur le liage des entités extraites avec les concepts du vocabulaire défini.Quatrièmement, nous avons proposé un algorithme de recommandation de prestataires qui exploite ces annotations sémantiques.Finalement, nous avons étudié l'apport de l'utilisation de connaissances ontologiques afin d'améliorer notre système d'aide à décision pour le domaine du sourcing.This CIFRE doctoral thesis is part of a collaborative research project between the I3S laboratory of the University of Côte d'Azur and the Silex company, and addresses the field of recommendation systems. Silex is a start-up that develops a Software-as-a-Service sourcing tool that allows companies to provide a description of their professional activities, their offers and/or the services they are looking for in natural language (currently French).In this context, the objective of this thesis is to propose a decision support system by exploiting the semantic knowledge that are extracted from the textual descriptions of requests for services and providers, in order to recommend relevant providers for a service request.The contributions of this thesis are the following. First, we proposed a vocabulary for the sourcing field by reusing and integrating existing vocabularies, in order to semantically annotate the textual descriptions of providers and requests for services. Second, we proposed an automatic alignment method to establish the correspondence between different concepts of the considered vocabularies. This approach is based on rules exploiting embedding space and measurements on groups of labels to discover the relationships between concepts. Third, we proposed an algorithm for extracting named entities from the textual descriptions of service requests and providers, and an algorithm for semantic annotation of these descriptions, based on the linking of the extracted entities with the concepts of the defined vocabulary.Fourth, we proposed a provider recommendation algorithm that exploits these knowledges extracted.Finally, we studied the contribution of using ontological knowledge to improve our decision support system for the sourcing domain in order to recommend relevant providers for a service request.The contributions of this thesis are the following. First, we proposed a vocabulary for the sourcing field in order to semantically annotate the textual descriptions of providers and requests for services. This vocabulary was built by reusing and integrating existing vocabularies. Second, we proposed an automatic alignment method to establish the correspondence between different concepts of the considered vocabularies. This approach is based on rules exploiting embedding space and measurements on groups of labels to discover the relationships between concepts. Third, we proposed an algorithm for extracting named entities from the textual descriptions of service requests and providers, and an algorithm for semantic annotation of these descriptions, based on the linking of the extracted entities with the concepts of the defined vocabulary.Fourth, we proposed a provider recommendation algorithm that exploits these knowledge extracted.Finally, we studied the contribution of using ontological knowledge to improve our decision support system for the sourcing domain

Theses.fr

Ingénierie des connaissances dans le domaine du sourcing pour la recommandation de prestataires

Author: Tounsi Dhouib Molka
Publication venue: HAL CCSD
Publication date: 26/03/2021
Field of study

Thèses en Ligne

INRIA a CCSD electronic archive server

HAL Descartes

Comparaison de l’utilisation d’un référentiel standard et d’un référentiel interne pour la recommandation d’objets d’apprentissage

Author: Faron Catherine
Rodríguez Rocha Oscar
Tounsi Dhouib Molka
Publication venue: HAL CCSD
Publication date: 27/06/2022
Field of study

National audienceAvec la transformation numérique, l’adaptation et le développ-ement des compétences sont devenus des facteurs majeurs pour améliorerles performances des collaborateurs et des entreprises. Comprendre lesbesoins des collaborateurs et les aider à atteindre leurs objectifs de déve-loppement de carrière est un véritable défi aujourd’hui. Dans ce travail,nous partageons notre expérience pour mettre en place un système derecommandation automatique permettant aux apprenants de trouver desobjets d’apprentissage pertinents en fonction de leurs objectifs d’appren-tissage. Cette tâche de mise en correspondance se fonde principalementsur la détermination de la similarité sémantique entre les objectifs d’ap-prentissage organisés sous forme de thésaurus et le texte des objets d’ap-prentissage. Nous avons étudié l’impact de l’adaptation et l’utilisationd’un référentiel existant par rapport à la construction d’un référentielinterne à l’entreprise pour définir les objectifs d’apprentissage. Les résul-tats des expérimentations montrent que la création et l’utilisation d’unréférentiel interne à l’entreprise est plus bénéfique que l’utilisation d’unréférentiel existant dans notre cas d’usage

INRIA a CCSD electronic archive server

Recommandation d'objets d'apprentissage basée sur des objectifs d'apprentissage en utilisant les modèles de plongement de phrases

Author: Faron Catherine
Rodríguez Rocha Oscar
Tounsi Dhouib Molka
Publication venue: HAL CCSD
Publication date: 01/01/2022
Field of study

National audienceWith the digital transformation, adaptation and development of skills have become key factors in improving employee and company performance. Understanding the needs of employees and helping them achieve their career development goals is a real challenge today. In this work, we share our experience to implement an automatic recommendation system that allows learners to find relevant learning objects according to their learning goals. This matching task is mainly based on determining the semantic similarity between goals and the text of learning objects. We comparatively evaluated three state-of-the-art pre-trained sentence embeddings models for the learning object recommendation task.Experimental results show that using sentence embeddings models in the recommendation process outperforms the Elasticsearch BM25 model generally used in industry.Avec la transformation numérique, l'adaptation et le développement des compétences sont devenus des facteurs majeurs pour améliorer les performances des collaborateurs et des entreprises. Comprendre les besoins des collaborateurs et les aider à atteindre leurs objectifs de développement de carrière est un véritable défi aujourd'hui. Dans ce travail, nous partageons notre expérience pour mettre en place un système de recommandation automatique permettant aux apprenants de trouver des objets d'apprentissage pertinents en fonction de leurs objectifs d'apprentissage. Cette tâche de mise en correspondance se base principalement sur la détermination de la similarité sémantique entre les objectifs et le contenu textuel des objets d'apprentissage. Nous avons évalué de manière comparative trois modèles pré-entraînés de plongement de phrases de l'état de l'art pour la tâche de la recommandation d'objet d'apprentissage. Les résultats des expérimentations montrent que l'utilisation de ces modèles de plongement de phrases dans le processus de recommandation est plus performante que le modèle BM25 d'Elasticsearch classiquement utilisé dans l'industrie. Mots-clés Apprentissage intelligent, Recommandation de cours de formation pour les collaborateurs, Systèmes de recommandation

INRIA a CCSD electronic archive server